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O , ' This paper describes briefly how one may utilize a class of species sampling mixture models derived from 

.^r ^ Doksum's (1974) neutral to the right processes. For practical implementation we describe an ordered/ranked 

' variant of the generalized weighted Chinese restaurant process. 

H 

. . The field of Bayesian nonparametric statistics essentially involves the idea of assigning prior and posterior 

1 -^ ' distributions over spaces of probability measures or more general measures. That is, similar to the classical 

C^ . parametric Bayesian idea of assigning priors to an unknown parameter, say 0, which lies in a Euclidean 

space, one views, for instance, an unknown cumulative distribution function, say F(t), as being a stochastic 
process. More generally for an unknown probability measure P, a Bayesian views it as a random probability 
measure. This is currently a well-developed and active area of research that has links to a variety of areas 
where Levy and more general random processes are commonly used. However, as discussed in Doksum and 
\^ ' James (2004), in the late 1960's, noting the high activity and advance in nonparametric statistics, David 

\^ . Blackwell and others wondered how one could assign priors which were both flexible and tractable. Arising 

OA ' from these questions were two viable answers which till this day remain at the cornerstone of Bayesian 

'nI , nonparametric statistics. 

vQ I Ferguson (1973, 1974) proposed the use of a Dirichlct process prior[see also Freedman (1963)]. For this 

f~^ ■ prior if P is a probability on some space ^, and {Bi, . . . , Bk) is a measurable partition of ^, then 

"^ I P{Bi), . . . ,P{Bk) has a Dirichlet distribution. Moreover, the posterior distribution of P given a sample 

.«_> ■ X = {Xi, . . . ,Xn) is also a Dirichlet process. For a specified probability measure H and a scalar 6 > 0, 

one can say that P :— Pen is a Dirichlet process with shape parameter 9H, if the Dirichlet distributions 



discussed above have parameters given by E[P(^i)] = 9H{Ai). Following this, Doksum (1974) introduced 

^ . the class of Neutral to the Right (NTR) random probability measures on the real line. For these models if 

J^ ' P is a distribution on the real line, then for each partition Pi, ... , B)^, with Bj — (sj_i, Sj], j — 1, . . . , fc, 

%> , So — —oOjSk — oo. Si < Sj for i < j; P(Pi), . . . , P(Pfc) is such that P{Bi) has the same distribution as 

^ ! ^i nj=i(l ~ ^O: where Vi, . . . , V2, ... is a collection of independent non-negative random variables. This 

represents a remarkably rich choice of models defined by specifying different distributions for the Vi . Notably 

if Vi is chosen to be beta random variable with parameters {ai^[3i) and /3i = X]7=i "^ji then this gives the 

Dirichlet process as described in Doksum (1974). Doksum (1974) shows that if P is a NTR distribution 

then the posterior distribution of P give a sample Xi , . . . , Xn is also an NTR. Subsequently, Ferguson and 

Phadia (1979), showed that this type of conjugacy property extends to the case of right censored survival 

models. This last fact coupled with the subsequent related works of Hjort (1990), Kim (1999), Lo (1993) and 

Walker and Muliere (1997) have popularized the usage of NTR processes in models related to survival and 

event history analysis. 
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2 NTR mixture models 

Despite these attractive points, the usage of NTR processes in more complex statistical models, such 
as mixture models, has been notably absent. This is in contrast to the Dirichlet process which, coupled 
with the advances in MCMC and other computational procedures, is regularly used in nonparametric or 
semi-parametric statistical models. The theoretical framework for Dirichlet process mixture models can be 
traced back to Lo (1984) who proposed to model a density as a convolution mixture model of a known kernel 
density K{y\x) and a Dirichlet process P as. 



(1) fiy\P) = / K{y\x)Pidx). 

J X 

This may be equivalently expressed in terms of a missing data model where for a sample Y = (Yi, . . . , Yn) 
based on Q, one has li, . . . , F„|X, P are such that Yi are independent with distributions K{-\Xi), Xi\P are 
iid P and P is a Dirichlet process. It is clear that the description of the posterior distribution of P and related 
quantities is much more complex than in the setting discussed in Ferguson (1973). However, Lo (1984) shows 
that its description is facilitated by the descriptions of the posterior distribution of P|X, given by Fergu- 
son (1973) and the exchangeable marginal distribution of X discussed in Blackwell and MacQueen (1973). 
Blackwell and Macqueen describe the distribution via what is known as the Blackwell-MacQueen Polya urn 
scheme where P(Xi ^ A) ~ H{A) and for n > 1 

(2) nxn e -ixi, . . . , x„_i) = J H{-) + ^ Y. ^^.(•)- 

+ n — 1 u + n— \ ^-^ 

Note that (0) clearly indicates that there are ties among [X\, . . . ,X„) and that the n{p) < n unique 
values, say X^, . . . , X*, s are iid with common distribution H. Letting p = {Ci, . . . , Cn(p)} denote a partition 
of the integers {1, . . . , n}, where one can write Cj = {i : Xi = X*}, with size Uj = \Cj\ for j = 1, . . . , n(p). 
This leads to the following important description of the distribution of X, 

"(p) 
■K{dX\9H) = PD(p|6') YI H{dX*) 
i=i 

where 

is a variant of Ewens sampling formula[see Ewens (1972) and Antoniak (1974)], often called the Chinese 
restaurant process. It can be interpreted as P(Ci, . . . , C„(p)) = pg{ni, . . . , Wnjp)) where pg, being symmetric 
in its arguments, is the most notable example of an exchangeable partition probability function(EiPT'F) [see 
Pitman (1996)]. It is easily seen that a Dirichlet Process with shape 9H is characterized by the pair {pg, H). 
Letting p(ni, . . . ,nfc), for n(p) = fc, denote an arbitrary EPPF, Pitman (1996) shows that the class of random 
probability measures whose distribution is completely determined by the pair (p, H) must correspond to the 
class of species sampling random probability measures. General species sampling random probability measures 
constitute all random probability measures that can be represented as 

oo oo 

(3) P(.)=^F.<5z.(-) + (l-E^^-)^(-) 

i=l fc=l 

where < P^ < 1 are random weights such that < X^i^i Pi ^ 1j independent of the Zi which are iid 
with some non-atomic distribution H . Furthermore the law of the {Pi) is determined by the EPPF p. Noting 
these points Ishwaran and James (2003) described the class of species sampling mixture models by replacing 
a Dirichlet process in (Q with P specified by Q. See also Miiller and Quintana (2004). 
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Except for the special case of the Dirichlet process, NTR processes are not species samphng models 
and this is one of the factors which makes analysis a bit more difficult. Nonetheless, James (2003, 2006) 
was able to extend the definition of NTR processes to a class of random probability measures on more 
general spaces, which he called Spatial NTR processes. Additionally a tractable description of the marginal 
distribution of this class of models was obtained. These two ingredients then allow for the implementation of 
NTR mixture models. Our goal in this note is not to describe the mechanisms for a full-blown NTR mixture 
model, as this requires much more overhead, but rather mixture models based on species sampling models 
which are derived from NTR processes. James (2003, 2006) introduced and calls these NTR species sampling 
models. Quite specifically, though the NTR processes are not species sampling models they produce EPPF's 
p that, along with the specification of H, are uniquely associated with an NTR species sampling model. 
This produces a very rich and flexible class of random priors that are a bit simpler analytically than NTR 
processes. An interesting fact is that this class contains the two-parameter (a, 9) Poisson-Dirichlet random 
probability measures for parameters < a < 1 and 9 > 0. That is the Dirichlet process and a class of 
random probabilities defined by normalizing a stable law process and further power tempering the stable law 
distribution, which are discussed in Pitman (1996) and Pitman and Yor (1997). Implementations of these 
latter models, being quite special, may be treated by computational procedures involving random partitions 
discussed in Ishwaran and James (2003) or by the methods in Ishwaran and James (2001). Here we will 
discuss a ranked weighted Chinese restaurant procedure which applies more generally. 



2. NTR and related processes. Let F{t) denote an NTR cumulative distribution function on the 
positive real line. Additionally, let S{t) = 1 — F{t) denote a survival function. Doksum (1974) Theorem 3.1 
shows that F is an NTR process if and only if it can be represented as 



(4) 



Fit) 



1 



-Y{t) 



where Y{t) is an independent increment process which is non-decreasing and right continuous almost surely 
and furthermore linit^oo Y{t) = c» and limt^_oo Y{t) — almost surely. In other words Y belongs to the 
class of positive Levy processes. We shall suppose hereafter that T is a positive random variable such that 
given F its distribution function is F where F is an NTR process. Then T has an interpretation as a survival 
time with "conditional" survival distribution S{t) = 1 - F{t) := P{T > t\F). It is evident from Q that the 
distribution of F is completely determined by the law of Y which is determined by its Laplace transform 



E 
where (j){uj\s) is equal to 

(5) r{l-e'nT{dv\s) 



,Y{t) 



-/Q*0(w|s)Ao(ds) :^ E\(S(t)y 



(1 - (1 - ur)p{du\s) 



LU{1 — u) 



p{dv\s) 



du 



Jo Jo Jo 

T and p are Levy densities on [0, oo] and [0, 1] respectively which are in correspondence via the mapping 
y —^ 1 — e~y . Without loss of generality we shall assume that /„ up{du\s) — 1 for each fixed s, which implies 
that 4){uj\s) = 1. Hence we have that 



E[S{t)] 



-Ao(t) :^l^Fo{t) 



where Fq represents one's prior belief about the true distribution and Ao((it) == Fo((it)/S'o(t— ) is its corre- 
sponding cumulative hazard with iS'o(i— ) = 1 — FQ{t—) = P(T > t). 

Note that for each fixed s, 4){io\s) corresponds to the log Laplace transform of an infinitely-divisible 
random variable. It follows that different specifications for r or equivalently p lead to different NTR processes. 
When r and p do not depend on s, F , Y and all relevant functionals are said to be homogeneous. We also 
apply this name to r and p. Additionally (/)(w|s) specializes to 



M:= 



fl 



)r(du) 



il~{l~ur)pidu) 



f 

Jo 



LU{1 — u) 



p{dv) 



du. 



4 NTR mixture models 

Consider now the cumulative hazard process of F, say A, defined by A{dt) = F{dt)/S{t—). The idea of 
Hjort (1990) was to work directly with A rather than F. He showed importantly that if one specified A to 
be a positive completely random measure on [0, 1], whose law is specified by the Laplace transform 

grg-wA(t)l ^ ^- Jg if>{u,\s)Ao{ds) 

where tp{uj\s) := /^ (1 — e^"")p(d-u|s), then F and S must be NTR processes specified by Q. James (2003, 
2006) shows that one can extend the definition of an NTR process to a spatial NTR process on [0, oo] x ^ 
by working with the concept of a random hazard measure, say Anidt, dx). Ah is a natural extension of A 
in the sense that Anidt, ^) — A{dt) and is otherwise specified by replacing the intensity p(du|s)Ao(ds) by 
p{du\s)Ao{ds,dx), where, 

Ao{ds,dx) = H{dx\s)Ao{ds) 

is a hazard measure and H{-\s) may be interpreted as the conditional distribution of X\T — s. A Spatial 
NTR process (SPNTR) is then defined as 

(6) Ps{dt,dx) = S{t-)AH{dt,dx) 

The SPNTR in © has marginals such that Ps{dt,d36') = F{dt) is an NTR and 

(7) Ps{[0,^),dx)^ S{t-)A„{ds,dx), 

Jo 

represents an entirely new class of random probability measures. 

2.1 NTR species sampling models 

NTR species sampling models arise as a special case of by setting H{dx\s) := H{dx). Here we will further 
work only with the class of homogeneous processes and hence we will additionally choose p{du\s) = p{du). 
Thus an NTR species sampling model is of the form 

/>oc OQ 

Pp,H{dx) = / S{s-)AH{ds,dx) = ^Pfc<5z,(dx). 
•^0 fc=l 

Furthermore, if _P = Pp^H the we denote its law as 3^{-\p, H). It follows that for practical usage in mixture 
models one needs a tractable description of the corresponding EPPF, say pp. However, before we do that 
we will need to introduce additional notation which connects Pp with the NTR process. If we suppose 
that Xi, . . . ,Xn\Pp.H are iid with distribution Pp^u, then these points come from a description of the n 
conditionally independent pairs (Ti, Xi), . . . , (T„, X„)|Ps where (Ti, Xi) are iid Ps, such that Ti are iid P, 
where F is an NTR, and Xi are iid Pp^H- Here Ps must be specified by the intensity p{du)Ao{ds)H{dx). 
Now if one denotes the n{p) unique pairs as {T*,X*) for j=l,. . . , n(p), then one may simply set each 
Cj ^ {i : Ti = T*}. Furthermore we define r(i:„) > T(2:„) > ... > T(n(p):n) > to be the ordered 
values of the unique values {T*)j<n{p)- Hence we can define p by setting Cj :~ {i : Ti = T*}, and define 
m = {Di, . . . , il'„(p\} with cells Dj = {i : Ti = Ttj.^\} with cardinality dj — \Dj\. It is evident that 
given a partition p — {Ci, . . . , C„(p)}, m takes its values over the symmetric group, say c5^„(p)5 of all n(p)! 



m 



permutations of p. Let Rj-i = Ufe=i ^fe -^ {i '■ Ti > T(^j;n)} with cardinality rj-i = X]fe=i'^'s- Then, 
terms of survival analysis, the quantities dj and tj — dj + rj^i have the interpretation as the number of 
deaths at time T^^j.^), and the number at risk at time T/^j-.n), respectively. See James (2006) for some further 
elaboration. Now from James (2003, 2006) it follows that 



(8) 7rp(p) =Pp(ni,...,n(p)) 



Y- llj 



n"(p) 



n- 



Lancelot F. James 
where, 



Jo 



The form of the EPPF is in general not directly tractable. However by augmentation one sees that the 
distribution of m is given by 

(9) TTpim) = -^ ^—^ 

and has a nice product form. This suggests that one can work with a joint distribution of (X, m) given by 

ri(p) 

^p(m) n HidX*). 
Related to this, James (2006) shows that a prediction rule of X„+i|X, m is given by 

n(p) T!(p) 

P(X„+i e da;|X,m) = (1 - JZ Pj:n)Po{dx) + ^ pj-.nSx^idx), 
with (1 - ^2"]=! Pj-n) = J2'j=i^^ 1]-n, and where 



Pr.n = 



n 



and 

_ Kl,r,_i(/0) nrif ^rfi.n-i + l(p) y^^ 0(r;) 

with (7„(p)+i:ri = Ki,n(/5)/<?!'(f^+l), are transition probabilities derived from 7rp(m). Note that in the calculation 
of Ki,r _i(p), Tj-i + 1 is to be used rather than rj = rj-i + nij. As an example, consider the choice 
of a homogeneous beta process [Hjort (1990), see also Ferguson (1974), Ferguson and Phadia (1979) and 
Gnedin (2004)] defined by 

p{du) = eu-^{i-u)'^-'^ 

then it is easily seen that 4){rj) — X];4i ^/(^ + ' ^ l)i and it follows that in this case 

dj "yr 4>{ri) _ _l 1 jT- (t^jri) 

^''" ~ " + ^ fi '^(^' + 1) *'" " " + ^ YZ^^' i/(^ + * - 1) 'i -^(^^ + 1) ■ 

Remark 1. Gnedin and Pitman (2005a) also obtained the expressions ^ and independent of 
James (2003, 2006), and in a different context. See James (2006) for more details. 

Remark 2. Related to this, Gnedin and Pitman (2005a) [see additionally Gnedin and Pitman (2005b)] 
showed that the EPPF in (jSJ corresponds to that of the two-parameter (a, 0) Poisson-Dirichlet process with 
parameters < a < 1 and > if p ;= pa,e is chosen such that, 

Paeidv) = ^. ^ — -u (1 — u) . 

'^ ■ ^ ' r(l-a)r(l + 6') ^ ' 



6 NTR mixture models 

From this, James (2006) deduced that Pp^ ^,h = Y^^=i ^k WiZi (1 ~ Wi)5zt, where {Wk) are independent 
beta (1 — a, 9~\-ka) random variables independent of the {Z}.) which are iid H. That is a two-parameter (a, 9) 
Poisson-Dirichlet process, for < a < 1 and 6* > can be represented as the marginal probability measure 
of a spatial NTR process, as described above. See Pitman and Yor (1997) and Ishwaran and James (2001) 
for more on the stick-breaking representation of the two parameter Poisson-Dirichlet process. 

3. NTR species sampling mixture models. Now setting P = Pp^H in (0 yields a special case of 
the species sampling models described in Ishwaran and James (2003). That is 



(10) / K{y\x)Pp^H{dx) ^ / K{y\x)S{s-)KH{ds,dx) 

is called an NTR species sampling models. We look at the situation where Yi, . . . , Yn\Pp,H are iid with density 
or pmf (|10|l . This translates into the hierarchical model, 

(11) X,\P - P 

P ^ ^{■\p,H) 

In principle, since we have a description of the EPPF, the theoretical results and computational procedures 
described in Ishwaran and James (2003) apply. However as we have noted in general TTp{p) is not as simple 
to work with as 7rp(m). So here we develop results that allows us to sample from a posterior distribution of 
m rather than partitions. We summarize these results in the next proposition 

Proposition 3.1 Suppose that one has the model specified in |i Jl) . Then the following results holds 

(i) The distribution of Xi, . . . , X„|Y, m is such that the unique values X* for j — 1, . . . , n{p) are condi- 
tionally independent with distributions 

7r{dX;\D,) o, HidX*) Y[ K{Y,\X*). 
(ii) The posterior distribution o/m|Y is, 



7rp(m|Y) (X TTpim) J] / II K (Y,\x) H (dx) . 



(Hi) The posterior distribution o/p|Y is 



Y, 'rp(mlY) = 7r,(p) H / H K iY,\x) H (dx) . 



D 
From this result one can compute a Bayesian predictive density of Y„-|_i|m, Y as, 



Z(n)-/(y„+i|m,Y) = 



»(p)+i 
Y. '^r-n 



/ K{Y„+i\x)Hidx) + y]p,.,„ f K{Yn+i\x)Tridx\D,). 



A Bayesian density estimate analogous to Lo (1984) is then to sum this expression relative to the distribution 

of mlY. 
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Corollary 3.1 Consider the model in Proposition 3.1, then a Bayesian predictive density estimator of 
y„_,_i|Y is given by 

E[/(r„+i|P)|Y]-^ J2 /(r„+i|m,Y)7rp(m|Y) 

p meS„(p) 

n 

3.0.1 Ordered/Ranked generalized weighted Chinese restaurant processes 

The significance of the expression for the predictive density, is that we can use l{n) in precisely the same 
manner as the predictive densities given p, Y, used in Ishwaran and James (2003) [see also Lo, Brunner and 
Chan (1996)] to construct computational procedures for approximating posterior quantities. In fact, all the 
major computational procedures for Dirichlet process mixture models, see for instance Escobar (1994) and 
Escobar and West (1995), utilize some type of predictive density. Here, in analogy to the gWCR algorithms 
in Lo, Brunner and Chan (1996) and Ishwaran and James (2003), we define a weighted version of the 
Ordered/ Ranked generalized Chinese restaurant process developed in James (2003, 2006), to approximate a 
draw from 7rp(ni|Y) as follows. For each n > 1, let {I?i:„, . . . , £'n(p):n}, denote a seating configuration of the 
first n customers, where Dj-n denotes the set of the n customers seated at a table with common rank j. 

(i) Given this configuration, the next customer n + 1 is seated at an occupied table Dj-n, denoting that 
customer n + 1 is equivalent to the jth largest seated customers, with probability. 



(12) ^ / K{Y„+^\x)7r{dx\D^^.,„-^) 



for j = l,...,7i(p). 

(ii) Otherwise, the probability that customer n + 1 is new and is the jth largest among n(p) + 1 possible 
ranks is, 

(13) ^ / K{Y,,+,\x)H{dx) 

for j = l,...,n(p) + 1. 

Similar to the gWCR SIS algorithms [see Ishwaran and James (2003, Lemma 2)], by appealing to the product 
rule of probability, repeating this procedure for customers {1, . . . , n}, produces a draw of m from a density 
of m depending on Y, say g(ni), that satisfies the relationship 

n(p) 



i(m)g(m) = 7rp(m) H / IT KY^\x)H{dx) 



j=l •' ^ i^Dj 



where L{ni) — Y[i=i '(* ~ !)• Hence for any functional, /i(m) it follows that 



^ V- ?,^ ^ ^ iv^ ^P^ms5„(,)^(m)£(m)g(m) 
E E Mm).,(m|Y)^ EpE....,,i^(m).(m) 



(14) E Z. Mm)^p(m|Y) = 

p me5'„(p) 



If the functional h{m) has a closed form, such as the predictive density E[/(j/|P)|m, Y] = /(y|m, Y), then 
one approximates (|14|l by using the rules in H12() and (|13|l to draw m. Repeating this procedure say B times, 
results in iid realizations say (ni(j-)) for b = 1, . . . , _B and one can approximate (|14|l by 

Ef=i'^(m(b))i(m(b)) 



Ef=ii(m(6)) 
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When the kernels K are set to 1, this procedure reduces to that described in James (2003, 2006) producing 
an exact draw from 7rp(m). For more intricate models one can incorporate a draw from the unique values 
X'^, . . . , X'\ . which has the same distribution that arises for the Dirichlet process. One can also incorporate 
draws from the posterior distribution of Pp_H{dx) which is described in James (2006). Otherwise it is a simple 
matter to modify all the computational procedures discussed in Ishwaran and James (2003, section 4). 

3.1 Normal Mixture example 

One of the most studied and utilized Bayesian mixture models is the Normal mixture model, specified by 
the choice of 



(15) 



Uy\P) 



CJ)a{v - x)P{dx) 



where 



bAz) = 



2-na 



exp 



z 
2^ 



is a Normal density, which is a natural candidate for density estimation. In the case of the Dirichlet process, 
this model was introduced by Lo (1984) and popularized by the development of feasible computational 
algorithms in Escobar (1994) and Escobar and West (1995). Suppose that (Yi) are iid with true density /o, a 
recent result of Lijoi, Priinster and Walker (2005) shows that /(j(-|P) in (|15|1 based on very general random 
probability measures, and a suitable prior distribution for a, have posterior distributions that are strongly 
consistent in terms of estimating the unknown density /o under rather mild conditions. In particular their 
result validates the use of rather arbitrary NTR species sampling models in this context with the classical 
choice of H set to be a Normal distribution with mean and variance A. Here setting a — \fB one has 

K{Y,\X,) = -^ exp [^^{Y, - X^f 

Using these specifications we present the details of the proposed algorithm. 

(i) Customer n + 1 is seated to a new table and assigned rank j among n(p) + 1 possible ranks with 
probability 



Qr. 



1 



: exp 






Xein + 1) ^2Tr{e + A) \ 2(0 + A), 

(ii) Customer n + 1 is seated to an existing table and is assigned rank j with probability 

^ Z^ieDj ^i + Yn+i A l^i^D. Yi 



Pj:n 



Ad, 



Ae(n + 1) V 2ne[e + A{dj + I)] 



exp 



'20 r"+' 



e + A{dj + 1) 



+ Ad, 



(iii) Additionally each X*|Y, m, is normally distributed with parameters 



1 



A 



and 



Mj 



- 'i E y.^ 



ieDi 



Xg{n + I) is the appropriate normalizing constant which is a special case of l(n). 

Remark 3. For comparison, the setup and notation we use is similar to that used in Ishwaran and 
James (2003, 6.1) which is based on weighted Chinese restaurant sampling of partitions p. 
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4. Concluding Remarks. We have given a brief account of how one can use Kjell Doksum's NTR 
models to create a new class of species sampling random probability measures which can be applied to 
complex mixture models. These models exhibit many features of the NTR models, in terms of clustering 
behavior, but as we have shown are simpler to use. Ideally one would like to describe parallel schemes for 
the more complex Spatial NTR models. However, this constitutes a considerably more involved study which 
we shall report elsewhere. More details can be found in James (2003, 2006) where explicit examples can be 
easily constructed. 

The representation in Q is important as it connects NTR processes to a large body of work on exponential 
functionals of Levy processes which have applications in many fields including physics and finance. For a 
recent survey see Bertoin and Yor (2005) . Some recent works which exploit this representation and are directly 
linked to NTR processes are Epifani, Lijoi and Priinster (2003) and James (2003, 2006). Additionally, outside 
of a Bayesian context, there is a notable body of recent work which has some overlaps with James (2003, 2006) 
and hence NTR processes by Gnedin and Pitman (2005a) and subsequent papers Gnedin and Pitman (2005b), 
Gnedin and Pitman and Yor (2005) and Gnedin, Pitman and Yor (2006). Although outside of a specific 
Bayesian context these papers contain results which are relevant to statistical analysis such as results related 
to the behavior of the number of ties n(p). The fact that these models arise from different considerations 
and different points of emphasis attests to their rich nature. We are quite interested to see what future 
connections will be made. 
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